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Abstract 

Let Ln = Lmbm (Xi, . . . , Xn', Yi, . . . , Yn) be the minimum length of a 
bipartite matching between two sets of points in R d , where X\, . . . , Xn, ■ ■ ■ 
and Yi, . . . , Yn, ■ ■ ■ are random points independently and uniformly dis- 
tributed in [0, l] d . We prove that for d > 3, Ln /N ' converges with 
probability one to a constant Pmbm{(1) > as N — > oo. 

1 Introduction and statement of the result. 

Given two sets of ./V points X — {Xi, Xn} and Y — {Yi, Yn} in R d , 
a bipartite matching of X and Y is a perfect matching M on the set X UY, 
such that each pair in M is made of one point of X and one point of Y. The 
length of such a matching is defined to be the sum of the euclidean lengths of 
the edges formed by its pairs. The (euclidean) minimum bipartite matching 
problem (MBMP) then asks one to find a bipartite matching of X and Y whose 
length is as small as possible. We shall denote by Lmbm(X, Y) the length of a 
minimum bipartite matching of X and Y . 

A related problem is the simple minimum matching problem (MMP), where 
one is asked to find a perfect matching of smallest euclidean length on a set 
X = {Xi, Xn} C R d . The subadditive methods inaugurated by Beardwood, 
Halton and Hammersley (BHH) j|] and further developed in || |o[|l^|, show 
that a strong limit theorem applies to the length Lmm(X) of a simple mini- 
mum matching on X, when the points X±, . . . , Xn are random. The theorem 
states that for any dimension d, if X\, . . . , Xn, ... is a sequence of points dis- 
tributed independently and uniformly in a bounded region ft C R d , then the 
ratio Lmm{X\, . . .Xn)/N 1 ~ 1 /' 1 converges almost surely to Vo\(^l) 1 ^ d j3MM{d), 
where Vol(f2) denotes the Lebesgues measure of Q, and 0Mu{d) > is a univer- 
sal constant depending only upon d. 



The functional Lmbm does not satisfy this form of limit theorem in dimen- 
sions 1 and 2. For d = 1, the MBMP amounts to a sorting problem and it is 
not difficult to show that if X and Y both consist of N points independently 
and uniformly distributed in [0, 1], there are constants < C\ < C 2 such that 
CiVN < L M bm{X,Y) < C 2 Vn with probability 1 - o(l) as N -> 00. More- 
over in that case the variance of Lm b m (X, Y) / VA does not converge to zero as 
N — > 00. (Lmbm is not "self-averaging", in the statistical physics' terminology.) 
For d = 2 Ajtai et al. |Q] proved a remarkable fact: if the sets X, Y are now 
distributed in [0, l] 2 , then for some constants C\,C2 indendent of N, one has 
Ciy/NlogN < L M bm(X,Y) < C 2 y/N\ogN with probability l-o(l). Numeri- 
cal simulations suggest that Lmbm{X,Y) / \J N log N converges to a non-random 
constant as N — ► 00, however this has not yet been proved. 

In this article, we show that for any d > 3 we recover a BHH theorem for 
the functional Lmbm- 

Theorem 1.1 Let X\, Xn, ■ ■■ and Y\, ...,Y/v , ■•■ be two sequences of random 
points independently and uniformly distributed in [0, l] d , where d > 3, and let 
Ln = Lmbm(Xi, . . . , Xn] Y\, . . . , Yn). There exists a constant (3m bm {d) > 
such that with probability one 

lim Ln/N 1 - 1 /* = 0MB M (d). 

iv— >oo 

2 Proof of Theorem |1 . 1| . 

To begin, we remark that to prove this theorem it will suffice to establish that 
Ln /N 1 ~ 1 / d converges in mean value to a constant /3mbm(^)- This is a conse- 
quence of the following lemma p^ |: 

Lemma 2.1 For any t > 0, one has 

Ln Lm ni v , A"^ 2/dj.2 

p (l^ - ^(^)l > «) ^ 2 -p( — §5— )• 

This result follows from the application of Azuma's inequality |}| and the mar- 
tingale difference method to Ln, in a way by now standard in the probabilistic 
theory of combinatorial optimisation ||13|| . Given the lemma, the theorem fol- 
lows easily from the convergence of ELn/N 1 ^ 1 ^ as A — > 00, by applying the 
Borel-Cantelli lemma. 

We have now to establish that for d > 3 the quantity ELn/N 1 ^ 1 ^ indeed 
converges to a constant (3m b m (di) > 0. To prove this we exploit the subadditiv- 
ity properties of Lmbm, in the spirit of Steele's theory of subadditive Euclidean 
functionals Jl^]. Let us divide the unit cube [0, l] d into disjoint similar sub- 
cubes Qk, k = 1, . . . , m d with edges of length 1/m, and compare the value of 
Lmbm(X,Y) to the sum 

fe=i 



2 



where Lk is the value of the functional Lmbm for the set of points Xi and Yi 
which belongs to Qk- A difficulty arises as in general the Qfe's do not contain 
the same number of points Xi and of points Yi . (In fact the special properties 
of the MBMP in dimensions 1 and 2 originate from the fluctuations of the 
differences between these numbers around their mean value 0.) To give meaning 
to the sum (Q) we need to generalize the functional Lmbm to matchings between 
two sets of different cardinalities. There are several ways to do this; we shall 
define Lmbm{X\, . . . Xn 1 ; Yi,... Y^ 2 ) by imposing that the minimum matching 
contains as few unmatched points as possible. That is if N\ > N 2 , we leave 
N\ — N 2 points of X unmatched, whereas if N% < N 2 we leave N 2 — N\ points 
of Y unmatched. 

Although expression ([j]) now makes sense, it is still not possible to write a 
subadditivity inequality of the same form as the one studied in Jl^|. Indeed, such 
a form (which Steele calls "geometric subadditivity") implies an upper bound of 
the form CW 1_1 / d for the functional at hand and it is easy to see that 
no such bound applies to Lmbm{X,Y). We shall however see that a geometric 
subadditivity property holds in the mean for the functional Lmbm- Suppose 
that the points X\ , . . . X^ 1 , Y\ , . . . Y/v 2 belong to an arbitrary cube Q having 
edge length a, and divide Q into disjoint cubes Q pi p — 1, . . .2 d by splitting 
each edge in two halves. Construct in each Q p an optimal matching in the sense 
just defined, between the points Xi and the n 2iP points Yi in Q p , and denote 
its length by L p . The points that are left unpaired are in number |ni, p — n 2 , p \ 
in each Q p , so if Lg denotes the length of an optimal matching for these points 
one has 

Lmbm (Xi , . . . Xn ± ', Y\ , ■ ■ ■ , Yn 2 ) < 2_j + ^° 

P =i 

<22L p + -aVd22\ni,p-n2,p\, (2) 
p— i p— i 

where the last inequality is obtained by bounding Lq in an obvious way. 

We shall apply this to Q = [0, l] d . Let Q Pl p% = 1, ... 2 d be the cubes 
obtained in the above subdivision; let Q PlP2 be the cubes obtained by splitting 
in two halves the edges of each cube Q Pl ; and so on. By repeating this operation 
K times, we get a subdivision with cubes Q Pl ... PK whose edges are of length 
1/2 K . Let ni^ Pl ... PK and n 2 ^ Pl ... PK be respectively the number of points Xi and 
Yi in Q Pl ... PK . Apply (||) first to the Q Pl ,... PK _ 1 , s j then to the Q Pl ... PK _ 2 's, etc, 
keeping at each step only those points which are still unpaired. It is easy to 
convince oneself that the number of unpaired points in each Q Plt .., PK _ k just after 
step k is given by \ni tPx ^.. PK _ h — n 2 ^ pi ^,, PK k \. After step k — K one obtains a 
matching between X\ , . . . Xn 1 and Y\ , . . . Yjv 2 where all the points but \Ni — N 2 \ 
are matched. One is thus led to the following inequality: 

Lmbm(Xi, . . . Xjfi, Yi, . . . Yjv 2 ) < ^ L Pi ,„ Pk 
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K n 

+ ^2^k H \ n l,Pl- Pk - n 2,Pi- Pk \- ( 3 ) 
k=l pi...pk 

We now proceed to derive a subadditivity property for the mean value of 
Lmbm{X,Y). We first consider the case where Ni = cardX and N2 = cardY 
are not fixed integers but are independent Poisson random variables with the 
same mean value N, the elements of X and Y being chosen independently and 
uniformly in [0, l] d . For a given k, the numbers ni iPlt ... Pk and ri2, Pl: ... Pk are 
then also independent Poisson random variables, with parameter N/2 kd . Let 
M(N) = EL MBM (X 1 , . . . X Nl ;Yi, . . . Yjv 2 ). It is immediate by homogeneity 
that 

EL P1 ... PK = 2- K M(N/2 Kd ). (4) 
Moreover from the well known properties of Poisson variables we have 

/ TV \l/2 

E\m, Pl ... Pk -n 2 , Pl ... Ph \ < v J • (5) 
By taking mean values in (||) we obtain: 

K 

M(N) < 2 K{d - 1) M{N/2 Kd ) + V2dNj2 2 k{d,2 - 1) . (6) 

k=l 

This inequality has been obtained for a subdivision of [0, l] d which consists in 
2 Kd similar cubes. Suppose now that we start from the subdivision X in m d 
similar cubes Qk k — 1, . . . m d , where m is an arbitrary integer. One can then 
reproduce the previous construction in the following manner. Let m = 2 + r 
where < r < 2 K . Consider the cube Qo = [0, 2 K+1 /m] d and form the natural 
subdivision So of Qo by 2^ K+1 ^ d cubes Q Po ,... PK whose edges have length 1/m. 
We can proceed with Qo and So to a K + 1 steps construction similar to the one 
which led to (j|). The only differences are that Qo has edges of length 2 K+1 /m 
rather than 1, and that some of the Q Po ... PK 's, namely those which belong to 
So but not to S, are empty. Nevertheless, we may write 

Lmbm (Xi , . . . Xn! ;Yi,..., Yn 2 ) — ^ Lk 

p=i 

J^Vd2 K ~ k ^ . 
^Z^ l n i*o-P* - najw-pJ 

k=0 Po—Pk 

K n 

-J2^k \ n ^P0- Pk - n 2,Po- Pk \- (7) 

fc=0 Po— Pit 

Now n\ >po ... Pk and n% >po ... Ph are Poisson variables with parameter lower than 
2iK-k)d N i m d < 2 ~ kd N so we still have 

N \i/2 



E\n 1 , Po ... Pk -n 2 , Po ... Pk \<V2^^ s J . (8) 
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Taking average values one is led to 



K 



M(N) < m d - 1 M{N/m d ) + 2 d V2dN^2 2 fc(d/2 ~ 1) . (9) 



k=0 

Dividing this last inequality by N 1 ^ 1 ^ and then replacing N by m d N, we get 

M{m d N) M(N) 2 d V2d A fe(d/2 _ x) 
(m^iV) 1 - 1 /^ - jyi-i/d iVV2-i/d ■ 

If d > 2, the sum on the r.h.s. of the last inequality is bounded above indepen- 
dently of N, and is divided by a positive power of N. Elementary analysis now 
shows that the ratio M{N)/N 1 ~ 1 / d necessarily converges to a limit (3mbm(<I) as 
N -> oo. Indeed, let f(t) = M(i d )/t d-1 . One verifies at once that /(t) satisfies 

f{mt)<f{t)+C/t d / 2 - 1 (11) 

for alH > and any integer to; /(£) is continuous, since M(N) is a continuous 
function of N. So the expression f[t) + Cd/t^ 1 is bounded in [L2] and since 
[1, oo[ is the union of the intervals m[l, 2], to > 1, it follows from (11) that fit) 
remains bounded as t — > oo, thus lim* f(t) < oo. Now define /3 = lim*/(i). 
For any e > 0, chose t > 1 and 77 > such that /(£) + C d /t d/2 ^ 1 < (3 + e 
for i in the interval 7 = [to — r),to + 77]. Since the intervals ml, m > 1 span a 
whole interval [^4, 00 [ for an A sufficiently large, it follows again from ( |Ti] ) that 
lim* f(t) </? + e. Since e is arbitrary one has lim* f(t) = j3, hence f(t) — » (3 as 
i -> 00, from which it follows that lim^oc M{N)/N l - l / d = /3. Q.E.D. 
We have thus shown for d > 3, that one has 

EL M bm(X u . . .,X Nl ;Yi, . . . ,Y Na ) ~ ^bmpM^ 1 - 1 ^, JV - 00 (12) 

when iVi and AT 2 are independent Poisson variables with parameter N. The 
same result for the mean value ELn, where TV is a fixed integer, follows then 
easily. Indeed, we have the obvious bound 

\Lmbm(Xi, . . . Xn',Yi, . . . Yn) — Lmbm(Xi, . . . Xjf^Yx, . . . Yjy 2 )\ 

< Vd(\Ni-N\ + \N a -N\), (13) 

whence taking mean values, 

\EL N -EL MBM (X 1 ,...X Nl ;Y 1 ,...Y N2 )\<2V2dN, (14) 
and dividing by N 1 ^ 1 ^ we deduce that 

jfe J^TT/d pMBM{d). (15) 



Theorem 1.1 is now proved. 
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3 Concluding remarks. 



1) Our decimation procedure does not give back the bounds proven by Ajtai et 
of. in d = 2, but a weaker 0(VW In N) bound. It is believed that a self-averaging 
theorem applies also to the functional Lmbm in dimension 2 

2) The estimation of the constants [3m bm {d) is also an interesting p roblem . A 
remarkable result of Talagrand shows that one has 0mbm{<I) = \J d/2en(l + 
0(lad/d)) as d — > oo. It is conjectured that a 1/d series expansion actually exists 
for Pmbm{cL). 

3) Mezard and Parisi have obtained detailed analytic predictions for the ran- 
dom link versions of the MMP and the MBMP ||, where the distance matrix 
between the points X, and Yj is replaced by a matrix of independent and identi- 
cally distributed entries. (Some of these predictions, for the random assignment 
problem, have been proven recently by Aldous ||.) Numerical studies || [?J 
indicate that for the MMP and the MBMP, the random link model provides 
one with a very good "mean-field" approximation to the Euclidean model in the 
large d limit. Except for simpler combinatorial problems however [B, very few 
rigorous results are known for comparing the euclidean and the random link 
models. 
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